Multi-granularity for knowledge distillation

نویسندگان

چکیده

Considering the fact that students have different abilities to understand knowledge imparted by teachers, a multi-granularity distillation mechanism is proposed for transferring more understandable student networks. A self-analyzing module of teacher network designed, which enables learn from teaching patterns. Furthermore, stable excitation scheme robust supervision training. The can be embedded into frameworks, are taken as baselines. Experiments show improves accuracy 0.58% on average and 1.08% in best over baselines, makes its performance superior state-of-the-arts. It also exploited student's ability fine-tuning robustness noisy inputs improved via mechanism. code available at https://github.com/shaoeric/multi-granularity-distillation.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Granularity in Multi-Method

Multi-method planning is an approach to using a set of different planning methods to simultaneously achieve planner completeness, planning time efficiency, and plan length reduction. Although it has been shown that coordinating a set of methods in a coarse-grained, problem-by-problem manner has the potential for approaching this ideal, such an approach can waste a significant amount of time in ...

متن کامل

Sequence-Level Knowledge Distillation

Neural machine translation (NMT) offers a novel alternative formulation of translation that is potentially simpler than statistical approaches. However to reach competitive performance, NMT models need to be exceedingly large. In this paper we consider applying knowledge distillation approaches (Bucila et al., 2006; Hinton et al., 2015) that have proven successful for reducing the size of neura...

متن کامل

Knowledge Distillation for Bilingual Dictionary Induction

Leveraging zero-shot learning to learn mapping functions between vector spaces of different languages is a promising approach to bilingual dictionary induction. However, methods using this approach have not yet achieved high accuracy on the task. In this paper, we propose a bridging approach, where our main contribution is a knowledge distillation training objective. As teachers, rich resource ...

متن کامل

Knowledge Granularity and Action Selection

In this paper we introduce the concept of knowledge granularity and study its in uence on an agent's action selection process. Action selection is critical to an agent performing a task in a dynamic, unpredictable environment. Knowledge representation is central to the agent's action selection process. It is important to study what kind of knowledge the agent should represent and the preferred ...

متن کامل

Multi-Granularity Noise for Curvilinear Grid LIC

A major problem of the existing curvilinear grid Line Integral Convolution (LIC) algorithm is that the resulting LIC textures may be distorted after being mapped onto the parametric surfaces, since a curvilinear grid usually consists of cells of di erent sizes. This paper proposes a way for solving the problem through using multi-granularity noise as the input image for LIC. A stochastic sampli...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Image and Vision Computing

سال: 2021

ISSN: ['0262-8856', '1872-8138']

DOI: https://doi.org/10.1016/j.imavis.2021.104286